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Tn thc Ciaims 

Please find below a listing of all of the pending claims. The sialnses of the claims are 
set forth in parentlteses. 

1. (Currently amended) A computerised method for mcasoring the degree of 
coherence of the arraogemcot of nodes in a hierarchy comprising the steps of: 

a) receiving a hierarchy of nodes; 

b) receiving a plurality of training cases that are filed under said nodes; and 

c) responsive thereto for determining a measure of coherence, for at least one node 
that has a local environment, by evaluating the training cases under the node with respect to 
the tFoining cases in &e local environment of the node. 

2. (Original) The method of claim 1 wherein the step of dctcmiining a measure of 
coherence includes die steps of 

determining, for the subtree at the node, the number of the tmining cases and the 
average prevalence of each feature in the training cases; 

detctmining, for the local environment of the node, the number of the training cases 
and the average prevalence of each feature in the training cases; 

detennining precUcdve features that distinguish the subtree of the current node fiom 
the local environment of the nod^ and 

generating a coherence value for the current node based on the average prevalence of 
at least one predictive feature 
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3. (Original) The method of claim 2 further comprising the steps of 
determining, for each said prediedve feature, the degree of uniformity of the 

prevalence of the predictive feature among the children subtrees of the node; 

and wherein the step of generating a coherence value for the current node is based on 
said degree of uniformity and the average prevalence of at least one predictive feature* 

4. (Original) The method of claim 1 whcrdn the hierarchy of nodes irtcludes a topic 
hierarchy; wherein the nodes are topics; and wherein tiie training cases includes one of 
labeled documents and feature vectors assigned to the topics. 

5. (Cunmtly amended) The method of claim 2 wherein the predictive features 
inc1ude[s] at least one of words, multi^word phrases, noun phrases, document length, file 
cjctensioa type, and other parameters related to docQmcntsrmd-ft-ca mbinatiofrth e r e Qf. 

6« (Currently amended) The method ofclaim 2 wherein the step of determining the 
predictive features includes the step of 

computing atleast one of information-gain metrics^ mutual-inforaiBtion metrics^ Chi- 
Squarcd, Fisher's Exact Test, lift, odds-ratio, word ficqucncy among documents, and word 
fi^uency among all words in all document s r and - a - combinotion ' tf ie r e of, 

7. (Qiigimd) The mediodofclaim 3 wherein the step of selecting features tiiat are 
uniformly common includes 
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the step of computing one of the metrics cosine-simjlarity, projccttoii, and ChiSqu&rcd 
between ihc average feature prevalence vector and the vector of training ease counts across 
the subtopics of the current node. 

8. (Original) The method of clabn 3 wherein (he step of generating a hierarchical 
coherence number includes the step of 

genetadng a hierarchical coherence number by computing the avcmgc prevalence of 
the predictive feature with the greatest degree of uniformi^^ 

9. (Onginal) The method of datm 3 wherein the step of generating a hlerorchical 
coherence number includes the step of 

generating a hierarchical coherence number by computing a weighted-average of the 
average prevalence of at least two features that are selected as boUi predictive and uniform. 

10. (Original) Tlie method of claim 9 wherein the step of generating a hierarchical 
coherence number indudcs the step of 

generating a hierarchical coherence number by computing a weighted-average of the 
average prevalence of the top k most prevalent features that arc selected as both predictive 
and uniform, wherein k is a predetermined positive integer, 

1 1 . (Onginal) The method of claim 9 wherein the weighted-average employs as the 
weighting schedule one of the negative exponential iunction exp(-I) and the inverse rank 
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function (1/1), where T is the ordered rank of the most prevalent fcatorcs that are selected us 
both predictive and iinifonn. 

1 2. (Original) The method of claim 3 wherein the step of generating a hierarchical 
coherence number includes the step of 

generating a hierarchical coherence number by computing the average value of the 
average prevalence of the top k most prevalent features that arc selected as both predictive 
and unifonn, wherein k is a predetermined positive integer, 

.13. (Original) The method of claim 2 wherdn ihc step of generating a hierarchical 
coherence number includes the step of 

genemtmg a hierarchical coherence nimibcr by employing a maximum, over all 
predictive features, of a projection between the average feature prevalence vector and the 
vector of training case counts across the subtopics of the current node. 

14. (Original) The method of dalm 2 wherein the step of genemting a hierarchical 
coherence number includes ihe step of 

genemting a hierarchical coherence number by employing a maximum average prevalence of 
the predictive features. 
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1 S* (Original) 1*he method of claim 1 further compnsisg the step of: 
assigning an aggrcgato-eohcrcncc value to a node in the hierarchy, ba^ed on an aggregation 
fbnction of said determined measure of coherence over said node and of descendants of said 
node. 

16. (Original) The method of claim IS wherein the aggregation function includes 
one of a sum, average, weighted-average, minimum function, and maximum ftmction. 

1 7. (Original) The method of claim 1 fisithcr comprising the step of: 

using the coherence values of one or more nodes to modiiy the structure of the 
hierarchy to improve the eohcrenee of Ihe hierarcihy^ 

1 8. (Original) The m^od of daim 1 further comprising the step ofi 

using the coherence values of one or more nodes to guide the selection of training 
cases for an automated classifier. 

1 9. (Original) The method of claim 1 ftirther comprising the step of: 

using the coherence values of one or more nodes to select a sintable classification 
technology to be employed to automatically classify Items in the hierarchy. 

20. (Cuncnfly amended) FAnl A computei^ed apparatus for measuring the degree of 
coherence of at least one considered node that has associated therewith a subtree and a local 
environment in a hierarchy comprising: 
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a) a trainuig case counter for determining the number of tnuning cases under the 
subtree and the number of training cases for the local cnvironmcat; 

b) an average prevalence detennination unit for determining fbr at least one feature 
the average prevalence under ihc subtree and the average prevalence for fbc local 
environment; and 

c) a predictive feature determination unit for determimng fhe set of predictive features 
that distinguish training cases of the subtree from documents of the local environment; and 

d) a coherence assignment unit for generating a coherence metric number for each 
considered node based on at least one predictive feature. 

21* (Currently amended) The apparatus of claim 20 Ihrther oomprising: 

[a_S)] a subtopic uniformity determination unit for determining the uuifonnity of the 

distribution of said predictive features among the children subtopics of the considered node; 
wherein the coherence assignment unit generates a coherence metric number based on 

at least one predictive feature that is determined to be uniibrmly distributed among said 

children subtopics* 

22. (Currently amended) A oomputeoTed system for measuring the degree of 
coherence of nodes in a topic hierarchy conq^rising: 

a) a coherence analyzer unit for receiving the topic hierarchy and a set of labeled 
training cases and ra^onsive thereto for determining, for at least one current node under 
consideration* a measure of coherence by evaluating fhe training cases and at least one 



PAGEl(M19*RCVDATm05 12:34:58PMIEa^ 



JUN-E7-E005(H0N) 11:27 HHNNHVR S KRNG. P. C. 



(FRX)703 880 5270 



P. 011/019 



PATENT Atty Docket No.: 10008388-1 

App. Ser. No.: 1 0/066,096 

feature under the local environment of the current node and by evaluating the training cases 
and at least one feature under die subtree of the node under consideration. 

23. (Original) The system of claim 22 fiirther comprising: 

b) a user interfhce presentation unit coupled to tiie coherence analyzer unit fisr 
displaying a measure of coherence for one or more current nodes under considenition. 

24. (Origjnal) The system of claim 22 further compiismg: 

b) feature extractor coupled to the coherence analyzer for receiving a set of labeled 
documents and at least one feature guideline and respomd ve thereto for generating the set of 
labeled feature vectors. 

25. (Original) The system of daxm 22 when^ the coherence analyzer &rdier 
comprises: 

a^l) a training case counter for dctcnnining the number of traim'ng cases under each 
node subtree; 

aJ2) im average prevalence determination unit for dctcnnining the average prevalence 
for at least one feature under each node subtree; and 

a_3) a predictive feature determination unit for detemiining predictive features under 
each node subtree; and 

aj4) a coherence assignment unit for generating coherence metric number based on at 
least one predictive feature. 
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26. (Originsil) The system of claim 25 whcrcia the coherence analyser fu^ 
comprises: 

a_S) a subiopic untformity determination unit for detennining the degree of 
uniformity in the distribution of one or more said predictive features among the children of 
tiic current node; 

wherein the coherence assignment unit generates a coherence metric number based on 
at least one uniform predictive featuiCL 

27. (Original) A computerised method for measuring the degree of eoherfrncg for 
one or more coirent nodes in a hierarchy comprising the steps of: 

a) receiving a bierarehy and the treining cases filed into said hierardiy; 

b) determining a h'st of predictive features that distinguish documents of the current 
node^s sub^tree fit>m tiiose in the current node^s loeal environment; 

e) assigning a coherence value to the current node -based on the list of predictive 
features and based on one or more of their degree of predictivcness» their the degree of 
prevalence, and their degree of uniformity, wherein the degT^ of uniformity reflects how 
evenly distributed said predictive features are among the children subtrees of the current node 
based on the training cases under each child subtree. 
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